Fpga-based Rectification and Lens Undistortion for a Real-time Embedded Stereo Vision Sensor
نویسندگان
چکیده
Modern applications such as robot assembling automation, upcoming mobile robot platforms for homes, and car safety features require both, 3D perception and object classification, for navigation and object manipulation [9]. Real-time performance of approximately 30 frames per second is mandatory and a very critical design issue. Embedded stereo vision sensors, consisting of a sensor head and a calculation unit, are very well suited for stereoscopic perception but require huge computational effort. Due to mounting tolerances within the sensor head, resulting in a maximum relative shift and revolution of these two camera images, rectification is absolutely necessary to reduce the matching effort. In our employed method it is done by applying a remap with offline calculated coefficients. The camera lenses also have an impact on the source image, resulting in a distortion in border areas which is reversed by a second remap. By undistorting and rectifying the original camera images, the computational effort for subsequent tasks like stereo matching is dramatically reduced, thus setting a very important step to achieve real-time performance. Approaches implementing lens undistortion and rectification on DSP based platforms or general purpose CPU based platforms are very often insufficient [6]. Considering reduced power usage and small form factors, general purpose CPUs (even with multimedia extensions) are outperformed by Digital Signal Processors (DSPs) and their massive parallel architecture [4]. Also the DSP’s resource usage, including computation and memory transfers for this correction process, is far too high and nearly leaves no evaluation time for high level applications. High memory bandwidths, low memory latencies, and the overall resource consumption inhibit the usage of DSPs for this application [2]. The presented solution to overcome this bottleneck is a dedicated hardware implementation, bypassing the disadvantages of common platforms. Current Field Programmable Gate Array (FPGA) systems deliver enough resources and performance, by reduced form factors to fulfill all necessary requirements [7],[10]. Their tightly coupled memory architecture, and their possibility to flatten designs qualifies them for image processing [11]. Therefore, a generic IP-Core with a simple input/output image interface, a parameter cache interface for the combined undistortion and rectification coefficients and minimized FPGA utilization is designed and tested on an existing platform. By keeping this core generic, it offers a nearly seamless integration and reusability for future projects, too. This paper is organized as follows: Section 3 shortly briefs on the basics and necessities of rectification and undistortion. Sections 4 and 5 present the proposed architecture, describe the target oriented implementation and synthesis results. The conclusion is supplemented with techniques for eliminating the cache architecture by on-the-fly calculation of the remap coefficients.
منابع مشابه
An FPGA-based processing pipeline for high-definition stereo video
This paper presents a real-time processing platform for high-definition stereo video. The system is capable to process stereo video streams at resolutions up to 1, 920 × 1, 080 at 30 frames per second (1080p30). In the hybrid FPGA-GPU-CPU system, a high-density FPGA is used not only to perform the low-level image processing tasks such as color interpolation and cross-image color correction, but...
متن کاملDesign and Implementation of an FPGA-based Stereo Vision System for the EyeBot M6
The goal of this thesis is the design and implementation of an FPGA based stereo vision system on the EyeBot M6. The research platform named EyeBot M6 as the successor of M4 is built to allow the realization of stereo vision tasks for small robots. Stereo vision uses two cameras separated by distance and a process called triangulation to determine the depth information of the environment. Trian...
متن کاملField Programmable Gate Array–based Implementation of an Improved Algorithm for Objects Distance Measurement (TECHNICAL NOTE)
In this work, the design of a low-cost, field programmable gate array (FPGA)-based digital hardware platform that implements image processing algorithms for real-time distance measurement is presented. Using embedded development kit (EDK) tools from Xilinx, the system is developed on a spartan3 / xc3s400, one of the common and low cost field programmable gate arrays from the Xilinx Spartan fami...
متن کاملTowards Hardware Accelerated Rectification of High Speed Stereo Image Streams
The process of combining two views of a scene in order to obtain depth information is called stereo vision. When the same is done using a computer it is then called computer stereo vision. Stereo vision is used in robotic application where depth of an object plays a role. Two cameras mounted on a rig is called a stereo camera system. Such a system is able to capture two views and enable robotic...
متن کاملStereoscopic 3D Reconstruction using Motorized Zoom Lenses within an Embedded System
This paper describes a novel embedded system capable of estimating 3D positions of surfaces viewed by a stereoscopic rig consisting of a pair of calibrated cameras. Novel theoretical and technical aspects of the system are tied to two aspects of the design that deviate from typical stereoscopic reconstruction systems: (1) incorporation of an 10x zoom lens (RainbowH10x8.5) and (2) implementation...
متن کامل